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ABSTRACT 


Linear Programming (LP) is used infrequently for routine 
decision-making. Even in situations where LP is an extremely 
attractive tool, there is too much cost, frustration, delay 
and risk incurred in conversion of a mathematical hypothe- 
Sis into a valid LP solution. This report outlines an 
entirely new approach to specifying and generating LP's 
which departs fundamentally from classical methods in an 
ambitious attempt to mitigate their most onerous disadvan- 
tages. These ideas are implemented and tested in a new 
modeling language and software system called LEXICON. Using 
LEXICON, a model is conceived, formulated, specified, ex- 
pressed, internally documented, verified and directly exe- 
cuted in a single form. The LEXICON language is derived 
from a modeling form proposed by Geoffrion. The software 
engineering of the LEXICON system admits expansion of the 
language, portability, and linkage with contemporary real- 


time LP solvers. 
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The goal of this research is to make linear programming 
(LP) models much more reliable and responsive to the needs 
of decision makers. Although models can be critical in 
decision-making [Ref. 1], the actual usage of linear pro- 
Gramming remaims surprisingly dow,e.g.,» [Ref. 2]. A 
principal reason for this paradox is the time lag between 
Ene LOrmulation Of a model and the production of correct, 
optimized results. 

Practical linear programming problems are conceived in 
a form understandable by people, but solved in a form con- 
venient for a computer algorithm. Fourer classifies these 
two Eepresentations as ™modeler's form and algorithm form 
[Ref. 3: pg. 143]. Modeler's form is written by a person 
and is usually expressed in symbolic notation as variables, 
constraints, and objectives. Algorithm form, on the other 
Dower ts Lead by a machine and is typically a variable-by- 
variable listing of the non-zero coefficients of the problem. 
Whereas modeler's form is conceptual and defines an entire 
class of LPs, algorithm form describes a specific instance 
of the problem. Before an LP model can produce solutions for 
use by decision makers, it must be translated from its 


conceptual description to a computationally efficient form. 


The method of translation selected greatly influences the 
timeliness of these solutions. 

The predominant approach to translation is to divide the 
task between the modeler and the computer by writing a 
computer program called a matrix generator. Although this 
method is a vast improvement over manual translation, there 
are substantial difficulties inherent in its use. Matrix 
generators are written in either general purpose programming 
languages or special programming languages designed for 
creating algorithm forms. Since these languages do not 
have the expressive power of pure mathematical notation, the 
relationship between a modeler's form and its matrix 
generator form 1S always abstract and often obscure. Thus, 
a matrix generator requires both internal documentation, 
inherent in any computer program, aS well as extensive 
external documentation of how it represents the modeler's 
f Cage 

Perhaps the greatest difficulty inherent in the creation 
of this intermediate form is the need to verify the matrix 
generator program. A matrix generator is especially diffi- 
cult to debug. Because its output 1s not intended to be 
read by humans, and can be voluminous for a large LP, manual 
inspection SE the list of coefficients produced by the matrix 
generator is neither an efficient nor a reliable means of 
verification. Instead, matrix generators are verified in- 


directly through a series of manual and automated procedures. 


Fourer describes this process in detail [Ref. 3: pg. 148] 
and concludes that 

..-e@ven an erroneous MG [matrix generator] can look 

correct to a person, can generate output that 

passes many diagnostic tests, and can represent 

an LP that has a plausible solution. Thus there 

is normally a non-negligible risk, in the use of 

an MG, that the wrong LP will be generated, solved, 

and analyzed. [Ref. 3: pp. 148-149] 
Hence, considerable human time and computer time can be 
spent to achieve a less than reliable matrix generator. 

Along with documentation and identification, there is 
the unavoidable problem of modification. Whenever the 
modeler's form is changed to correct deficiencies in the 
model or to test a new hypothesis, the corresponding matrix 
generator must be revised. A change to a modeler's form 
can be instituted in a matter of hours; revising, verifying, 
and documenting this change in the matrix generator may 
require days or weeks. In a planning environment, where 
model structure is frequently altered and only a few produc- 
t10n runs are made with each version, modification of the 
matrix generator can become an onerous and persistent task. 
A simpler alternative to matrix generators is to create 

an executable modeler's form. This approach dispenses with 
intermediate forms altogether and makes the computer, not 
the modeler, responsible for the veracity of the modeler's- 
form-to-algorithm-form translation. The concept is straight- 


forward: the modeler writes his modeler's form in a computer 


language designed for modeling; the computer reads this 
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symbolic description of the LP, along with the corresponding 
parameter data, and produces the prescribed algorithm form. 

The executable model approach requires two components: 

a modeling language and a modeling language translator. 

A modeling language is a declarative language that expresses 
the modeler's form in a notation that the computer can inter- 
pret [Ref. 3: pg. 144]. As such, it must satisfy two con- 
flicting sets of requirements. First, it must be convenient 
for people: it must be easy to learn, easy to use, and as 
powerful and flexible as the algebraic notation it is intended 
to replace. Second, it must be understandable to a machine: 
it must have an unambiguous syntax and a notation. compatible 
with ordinary computer hardware. The modeling language 
translator 1S a compiler: it parses the language, interprets 
its expressions, and converts them from their higher-level 
modeler's form into the lower-level form required by the 
solution algorithm. 

Executable linear programming models are in their infancy. 
Research to date has emphasized the development of modeling 
languages which resemble, as much as possible, common modeler's 
forms. These languages enable the modeler to formulate in 
his personal style and then convert his work into an executa- 
ble model. However, the requirement to conceive in one form 
and express in another form persists [Ref. 3: pg. 158]. 

A yet untried alternative is to change the way the modeler 
views his problem so that formulation in a modeling language 


1s both direct and natural. Geoffrion has proposed a theory 


el 


eoemodel building, called structured modeling, that supports 
this approach [Ref. 4]. The merit of his idea is contingent 
on the existence of a modeling language translator capable 
of executing his hypothesized modeler's form. 

The purpose of our research is to evaluate the feasibility 
of such a translator by deSigning and building an operational 
prototype modeling system based on Geoffrion's theory. This 
paper reports the details of that implementation, dubbed 
LEXICON, and the modeling language we have developed. 
Although structured modeling is applicable to models which 
are not LPs, our software has been designed specifically 
for practical linear programming problems. 

Using the LEXICON language, a modeler can conceive his 
model in an immediately executable and internally documented 
form. This symbolic description can be read by a computer, 
processed into algorithm form, solved and its solution returned 
for analysis without manual intervention. Thus, LEXICON 
increases the value of linear programming to decision makers 
by- allowing modifications to planning models to be made in 
minutes rather than days. 

Our research is presented in five chapters. Chapter II 
presents an overview of the theory of structured modeling.. 
Chapter III introduces the modeling language. Chapter IV 
describes the modeling language translator and principal 
system components. Chapter V discusses some of the practi- 
Seeaspeces OF the modeling language and the LEXICON system. 


Chapter VI presents our research conclusions. 
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If. OVERVIEW OF STRUCTURED MODELING 


This chapter 1S a Synopsis of a new theory of model 
building and model expression proposed by Geoffrion [Ref. 
4]. The definitions, concepts and constructs presented 
provide the terms of reference needed to understand the 
modeling language and the illustrative example contained 


in Chapremwali ire 


A. DEFINITIONS AND CO Neri: 

A structured model is composed of a finite collection 
of elements, partitioned into mutually exclusive and exhaus- 
tive element sets called genera, which in turn are organized 
into conceptual units called modules. Structured models are 
comprised of five types of elements: primitive entities 
(pe), compound entities (ce), attributes (a or va), functions 
(f), and tests (t). A primitive entity 1S an assertion of 
the existence of a physical thing or concept about which 
Statements can be made. Each model must have at least one 
element of this type. A compound entity is also an exis- 
tential assertion. It establishes a relation between other 
entities already defined that does not require a value. An 
attribute element has two parts: a tuple of entity elements 
and a unique value associated with that tuple. This value 
has a specified range and may be non-numeric. The fourth 


element type, the function, is identical to the mathematical 


is 


relation of the same name. It is defined by a domain of 
entity element tuples and a rule that associates a unique 
Poitie = anwseme range. The last ellement type, the test; 1s a 
function whose range is the logical values true-false. 

Each element of the model is associated with a unique 
genus (Singular of genera) which is composed of elements of 
only one type. We can thus unambiguously refer to the type 
of a genus. 

All genera, except primitive entity genera, are defined 
in terms of other genera. This defining relationship among 
genera provides the framework of a structured nodal and is 
abstracted in the notion of a calling sequence. The calling 
sequence of a genus notes all the other genera that its 
definition depends on directly. This establishes a relation- 
Ship "calls" (or “is partially defined by") among genera. 
For example, the calling sequence of an attribute genera 
"calls" the genera that it further describes. 

amie ivewenttty genéra (and optionally; ,other genera) 
introduce a named index. Each genus with more than one 
element has, through its calling sequence, an ordered set 
of the named indices. Each element of a genus is identified 
with a unique tuple of specific index values. 

In a structured model the genera are ordered so that 
genera only call previously specified genera. This "define 


before use" or "no forward referencing" ordering is referred 
to in [Ref. 4: ¢. 2.3] as acyclic-preserving. In general, there 


are many orderings satisfying this condition. 
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Genera contiguous within an acyclic-preserving ordering 
can be grouped into a higher conceptual unit called a module. 
Each genus is related to a unique module, this relationship 
is called "part of." Contiguous modules and genera may also 
be grouped into a module by this relationship. Except for 
one distinguished module, called the model module, each 
module and genus must be "part of" one other module. This 
relationship may not contain any cycles. 

The modules and genera together with the réVationsneme 
which both order and connect them may be viewed as forming 
an arborescence with the model module =a root, the genera 
as leaves and the other modules as interior nodes. Since 
in general there are many alternate groupings of genera and 
modules and there are many alternate acyclic-preserving 
orderings, the modeler has a wide latitude in developing his 
model. 

The function performed by a structured model module is 
to organize physical things, specifications and relatvonsiaiee 
among parts of the system being modeled into a conceptual 
structure which facilitates understanding for both the 
modeler and those who will use the model. An important 
feature of a module is that modules are themselves made up 
of still smaller modules. Thus, the modules of a structured 
model form a hierarchy. At the top level is the entire 
model and at the bottom level are the genera and their 


element sets. 


i> 


Bs PR MAY -CONSLERUCTS 

A single model instance may be obtained by enumerating 
all elements and their calling sequences, all attribute 
values, all function and test rules, and the acyclic- 
preserving and module relationships. In the sense of this 
paper, a model with these aforementioned characteristics 
is termed completely-specified. 

A class of similar models is obtained by separating 
the mathematical/conceptual formulation from the elemental 
detail necessary for computation. The symbolic description 
of this class is the modeler's form of structured modeling, 
called a master dictionary. The parameter data, called 
elemental detail, necessary to manipulate the model is 
Gentained in a dictionary extension called an element section. 

A master dictionary has one module paragraph for each 
module and one genus paragraph for each genus. It displays 


all modules and genera in a linear fashion in a way which 


is acyclic-preserving. Each paragraph contains 
1. the name of the module or genus; 
2. an interpretation of the module or genus in natural 


language; and 
3. aif a genus, the element type and other elemental 
information about the genus which may be given without 
enumeration of the individual elements. 
The element section supplements the information provided 


in the genus paragraphs. The amount of detail provided for 


kG 


each paragraph depends on the genus type and the degree of 
specification desired. The final level of specification 
may, for example, be complete except for the values of 
selected attribute elements. This state is referred to as 
A-partial specifications | bhesunspeci fied ellementeruaen— 
'varlables' of conventional models, are referred to as 
variable attributes (va). Notation and syntax for the 
modeling system developed in this thesis are presented 


in the next chapter. 


C. ANALYTIC USES OF A SPREUCTURED MODEL 


Models are used in a wide variety of contexts for differ- 


ing purposes. Many of the ways models are used can be 
summarized by four analytic modes: evaluation, retrieval, 
Satisfaction and optimization. The definitions which appear 


below presume an A-partially or completely specified model 
[Rem. 4: C. \oeuleie 
1. Evaluation: the process of determining the values of 
all function and test elements. If the model is 
A-partially specified, trial values must be given 
for the variable attributes. Evaluation is often 
called ‘Simulation. 
ae Retrieval: the process of answering queries about 
the element detail and conceptual structure of the 
model that do not require evaluation. 
3. Satisfaction: the process of specifying values for 


all variable attributes such that evaluation results 


ill 


in all test elements being true-valued. The variable 


attribute element values specified, called a feasible 


Solucteney Maymeconstertute an end in itself or a 


prelude Ee optimization. 
Optimization: the process by which a feasible 
solution is found that maximizes (or minimizes) the 


value of a selected real-valued function element. 
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Til. ULEATCON STRUCTURED MODELING NOTA ie. 


A. GENERAL 

LEXICON is a prototype structured modeling system for 
linear optimization problems. It accepts as input user- 
specified master dictionary and element section files; 
translates this external representation of the LP into an 
algorithm form compatible with the Brown and Graves XS 
mathematical programming optimizer [Ref. 5]; and submits the 
problem for solution. After an optimized result has been 
obtained, the modeling system reports the values of the 
Objective function and the decision variables in a labelled 
format. In addition to its primary role, LEXICON ofeeme 
two other procedures useful during model development. A 
verification sub-program iS available to locate and identify 
structural and grammatical inconsistencies in a master dic- 
tionary. This file can also be manipulated by a video 
display sub-program to create various file perspectives. 

This chapter presents the prototype modeling language 
designed for the LEXICON modeling system. Its syntax is 
Similar to one hypothesized for computational implementation 
by Geoffrion, but does not include all the executable features 
he envisions. Our design, however, does permit these 
exceptions to be implemented at a later date if warranted by 


computational experience. In order to achieve an executable 


1D 


model, three types of simplifying restrictions have been 
imposed on the modeling language. We will distinguish 
these as design, implementation, and prototype restrictions, 
although to some extent they all overlap. DeSign restrictions 
define the functional limitations of the software. Imple- 
mentation restrictions represent capability that was designed 
for but not implemented. Prototype restrictions represent 
specific restrictions in the prototype (like dimensions on 
arrays, lengths of names, etc.) that can be easily changed. 
Section A deals with the primitive tokens of the modeling 
language. It also introduces the ‘railroad diagram' used 
to document these building blocks and the more complex 
expressions discussed in later sections. Sections B and C 
present the syntax used in the master dictionary. Section 
D covers the makeup of the element section. Finally, Section 
E presents a structured model master dictionary and element 


section for a textbook modeling problem. 


B. LANGUAGE BUILDING BLOCKS 
im  4Charaeter ser 
The character set consists of digits, upper case 
letters, lower case letters and a subset of the special 
characters available in the EBCDIC character code, e.g., 
[Ref. 6]. The full character set is defined by the first 
meewweharct in fagure 3.1. This ‘railroad diagram' is tra- 


versed by entering at the top left-hand corner and exiting 
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character 






Note: “The cirele Containing no symbol 1S —cmomeaminn 


Figure 3.1 Character Set 


from the bottom right. The name above the entry shows the 
language token which the diagram defines. The boxes and 
circles within the chart are like turnstiles: they may only 
be passed through in the direction indicated and then only 
upon ‘payment’ of the required items. The payment required 


for circles and rounded boxes is the characters displayed 


inside them. Rectangular boxes require an entity defined 
elsewhere. The paths between the turnstiles should be 
treated like railroad lines. Shunting backwards past a 


sharp corner in the path is not permitted [Ref. 7: pg. 19]. 
2. Genstants 
The language recognizes both integer and real 


number constants. Real numbers may be expressed in either 


Zi 


decimal or scientific notation. Figure 3.2 defines their 
form. Sign conventions are defined by another construct. 
3. Symbolic Names 


Six types of symbolic names are used to build more 


complex expressions. Figure 3.3 shows the conventions used 
Zoregenus and module mames. Each paragraph name in the 
master dictionary must be unique. Formats for domain indices, 


index function names and an intermediate construct, the 
Semewgciertc Mame, are Gefined in Figure 3.4. The identi- 
Meme tne Sixth name type, 15 shown in Figure 3.5. Table 
3.1 lists the names which are language keywords. Details 
on keyword usage are provided later. Within the above 
limitations, the intention is to allow meaningful ee 
(e.g., mnemonics, abbreviations, etc.) to be used when 
describing and documenting the model. We suggest, for 
example, that genus names beginning with the characters 
Blemeeee Used LOr test genera and that the °$° character 


be included in cost or profit-related genus names. 


Cl aor R DICTIONARY CONSTRUCTS 

Each paragraph of a master dictionary is composed of 
an ordered collection of statements. Module paragraphs have 
pee ecustedimelOrm. “There are several forms of the genus 
paragraph, differing according to the type of elements com- 
prising the genus. This section defines the seven statement 


types used when forming paragraphs. 


ie 


Lnte yer 
eons came 





basic real 
constant 


iNeeoers . 
| constant mt 









integer 


constant | 






real 
constant 


‘ 


\_ [basic real. 
| constant 





a a a a a aa a ce 
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. .Jinteger 
yo constant | : 


ee 


Preise ss 42 Integer and Real Constants. 
Design Rest rilecemomc ee ee 02s 
Single-word Integer, Double- 


word Real 


ae 


genus name 


A 









upper =) 


letter i 
| 


, eo inte 






ed es 
| upper case letter | 


| 


Prototype Restriction: No More Than 10 Characters 


module name 


upper case 
letter 





Prototype Restriction: No More Than ll Characters 


Peele) 3. 3 Genus and Module Names 
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domain index 







' lower case 
\ 
Letee: 







lower case 
letter 


Prototype Restriction: No More Than 5 Charactere 


index function name 


ls inaexb-— aigit L——> 


basic generic name 











genus name 





Sees 


ce ) ‘domain Oe 





Prototype Restriction: No More Than 10 Domain Indices 


Note: generic names without a domain index are 
unindexed. 


Figure 3.4 Domain Index, Index Function Name and 
Basic Generic Name Constructs 
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identifier 





upper case 
“letter 







upper case 
letter 


Prototype Restriction: No More Than 6 Characters 


Bigure 3.5 


Identifier 


Table 3.1 LANGUAGE KEYWORDS 


Usage 


System Arithmetic 
Functions 


Relational Operators 


See Oberacor 


Data Descriptors 


Translator Reserved Word 


words 


ABS, MAX, MIN, MLX, SUM 


Ei GCE eee Gi, Gis NE 


SELECT 


&ATR, &FNX, &SET 


CHECK (not available to 
user) 
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l.  Interpretativen 
Interpretation is non-executable, natural language 
text intended as a comment to increase the documentation 
content of the paragraph. It is the only paragraph state- 
ment used by both modules and genera. Module interpretations 
should convey the sense in which each module unifies 
its constituent modules and/or genera. Genus interpreta- 
tions describe the typical element in the genus. 
2. Index Statement 
The index statement defined in Figure 3.6, introduces 
a distinct domain index for use with the elements of its 
genus paragraph. The values of the domain index are speci- 
fied by an ordered list of identifiers contained in the ele- 
ment section. Each primitive entity genus introduces a 
domain index. These and other genera that introduce domain 


indices for modeling convenience, are said to be directly 


index statement 





domain 
index 


domain 
index 









domain 
index 


Implementation Restriction: Only One Alias Allowed 


Figure 3.6 Index Statement 


au) 


indexed. All other genera that may contain more than one 
element are said to be indirectly indexed by one or more 
domain indices derived from the components of their calling 
Beastie statements. This procedure is explained later. 
Genera that must be singletons (contain only one element) 
are Said to be unindexed. 

Domain indices, whether introduced directly or 
indirectly derived from a calling sequence statement, pro- 
Pete the Aandex tuple component of the basicugenemuc mame 
used to denote a typical element of a genus. When each 
domain index is given a specific identifier value, the basic 
feeeele Name provides a unique element mame. The singleton 
elements of unindexed genera are uniquely identified by their 
genus name alone. 

The index statement is executable and allows a domain 
index alias to be introduced if required by the model. For 
example, if a network were described in a structured model, 
each node could be an element of a NODE genus indexed by the 
domain index ‘'1i' or its specified alias 'j'. If the arcs 
connecting the network nodes are elements of a genus called 
LINK, then LINK(i,j) clearly identifies a typical arc, 
Without introducing a separate genus and domain index to 
distinguish tail nodes from head nodes. 

Weed iiicmocaulence wlatvement 

The calling sequence statement gives the generic 


(typical) calling sequence for an element in its genus 
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paragraph. The generic calling sequence for the typical 
arc, LINK({1,}), would refer toWa typical taii@node, Neeaaae 
and a typical head node, NODE(j), in order to identify its 
position in a network. Thus, the calling sequence statement 
of the LINK genus would contain two NODE components and 
would be written as ' (NODE(1),NODE(j))'. 

Figure 3.7 shows that the calling Sequence statement 
is composed of one OF more genus components. The Sequeme ame. 
a called component, given in Figure 3.8, is founded on the 
basic generic name of the genus called, supplemented by 


special subscript expressions. 


calling sequence 
statement 





genus component 





Pita ea a7 Calling Sequence Statement 


These domain index substitutes allow each component the 
expressive power necessary to represent specific as well 
as multiple elements of its genus. In general, these 
expressions either: 

a) modify the index; 


b) asSign it a fixed value; 
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generic component 


£) 


2) 
3) 


4) 


genus name ) € 
domain index 


index function 


integer constant 







integer constant 





Option chosen instead 


of domain index 'i' Meaning 
= N Select the Nth value after 
(iinet itor pe LOoro .(1f =") 


the value Gr i.) Lf N iS omitted, 
select all values from 1 to the 
ene, (ita ) Gr trom 1 to the 


beginning (if "=") of its index 
set. 

N (an integer) Select the Nth value of 1. 

a dot Select all values of 1. 

index function Select a single value of i 


according to a rule that depends 
on the values of independent 
indices which form the index 
function argument. 


Restriction: No more than 10 domain indices or domain 
index replacements 


Figure 3.8 Generic Component Syntax 
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c) annihilate it by referencing all its identifiers; or 
d) render its value dependent on one or more other 
(independent) indices. 
A complete discussion of the index functional 

dependence introduced by option (d) must be preceded by 
an explanation of exactly how domain indices are determined 
for genera that do not introduce one through an index 
Statement. The domain indices of indirectly-indexed genera 
are specified by a minimum covering of the unreplaced domain 
indices, those modified by option (a) and the independent 
indices of those replaced by option (d) for all components 
of the calling sequence statement. Consider the calling 


sequence statement for the calling genus, '‘'B': 
(A(p; 772) 7 C(t Geese) ) 


The domain indices of 'B' are: ‘p', ‘j', ‘'1i' and '2.°* (Site 
the order assumed is left-to-right in order of appearance 
in the calling sequence statement, the generic name of 
genus ‘B' would” be written” Bieeg,  e 

The term ‘'k(j)' in component 'C(i,k(j))' is one 
instance of the syntax defined in Figure 3.9 for index 
functions. Here, 'j' is the independent index. 'k' is the 
index whose value is functionally dependent on the value 
assigned to 'j.' In general, a single identifier of the index 


range of 'k' is selected according to a rule that depends on 


onl 


mnaex function 


index function 
domain index 
name 


Prototype Restriction: No more than 2 domain indices 


Figure 3.9 Index Function Syntax 


the values of a subset of the calling genus‘ domain 

indices. The exact mapping is specified in the element 
section of the master dictionary. Since an index function 
may be invoked more than once in a single generic calling 
Sequence and also for more than one calling sequence state- 
ment, the notation specifies that the name of the index being 
rendered dependent be followed by a digit if more than one 
dependency is needed in the model. The LEXICON prototype 
allows up to nine distinct functional dependencies to be 
defined on any domain index. 

The calling sequence statement is executable docu- 
mentation for the physical relationships and dependencies 
the model describes. The LEXICON system is designed to 
verify the pre-definition of each genus and domain index in 
the statement, to create the internal representation neces- 


Sary to support index functions introduced and to determine 


ai74 


a default index set from the calling sequence statement 1f 
an index set statement is omitted from the genus paragraph. 
4. Index Set Statement 


The values of the index tuples of a generic name 


constitute a genus index set. The syntax for an index set 
name 1S given in Figure 3.10. Each genus element has a 
unique index tuple in its genus index set. The members of 


the set are ordered according to the order of appearance 
of the identifiers if there is just one domain index. If 
there are multiple indices, the set members are ordered 

lexicographically according to the order of domain index 
appearance in the generic name; this is called the index- 


induced order [Ref. 4: ec. 2.5]. 


index set name 
genus name 


Figure 3.10 Index Set Name 


The index set statement is an executable field 
which limits the size of the index set allowed by the 
generic name of its genus paragraph. There are several 
statement forms, differing according to whether the genus 


is directly-indexed, indirectly-indexed or unindexed. If 
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the genus is directly indexed, an integer entry documents 
the cardinality of the direct index set it introduces. 
Omission implies that the size of the set will be determined 
by enumerating its identifiers in the element section. The 
requirement that unindexed genera specify the integer 'l' as 
their index set statements is an implementation restriction. 

The index set statement of an indirectly-indexed 
genus either refers to a previously specified genus index 
set that possesses the same domain indices in the same order, 
Or provides a rule by which its indirect index set may be 
constructed from other genus index sets. Omission of an 
index set statement asserts that the genus is indexed by 
its default index set. A default index set consists of all 
domain index tuples for which a generic calling sequence is 
'well-defined.' A calling sequence is well-defined if each 
component in the sequence is completed as a valid element 
name. 

The LEXICON design allows an indirect index set to 
be specified in one of two ways. The keyword 'SELECT,' 
followed by an index set name argument, asserts that the set 
is a subset of the identifier-tuples formed by the Cartesian 
product of the domain indices of the named sets. The 
Cartesian product of two sets I and J is the set (denoted by 
ie) Of all pairs (1,j]) such that 1 1S a member of I and 
j is a member of J [Ref. 8: pg. 287]. In the sense of this 


paper, I and J are the direct index sets that range .the 
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domain indices 1 andj. K-tuples (k > 2) for an index set 
with 'k' domain indices can be formed by recursively applying 
this definition. The 'SELECT' option requires that the 
user specify the identifier-tuple of each element of the 
subset in the element section. 

The second way to define an indirect index set is 
as a Cartesian product of Cartesian product derived indirect 
and/or direct sets. If this latter option is preceded by 
'SELECT,' the intention is to define a subset of this 
aggregation. The syntax for the index set statement is 


shown in Figure 3.1l. 


index set 
statement 
index set 


; SELECT | 
name 


integer constant 






- index =) 


name 








Figure 3.11 Index Set Statement 


5. Genus Type Statement 
This executable field specifies the type of elements 
introduced by a genus paragraph. The two-character codes 


used are shown in Figure 3.12. 
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genus type statement 


ba > C2 Ce) Gt CEO 


Pigce 3.2 Genus Type Statement 


6. Range Statement 
The range statement implemented by the LEXICON system 
is a non-executable, optional field. It is intended as a 
comment to increase the information content of attribute 
paragraphs. Since the LEXICON deSign limits attribute 
element values to the real number system, we Suggest uSing 
this field to record the sign, magnitude, or other criteria 
useful in validating the model's data. 
7. Rule Statement 
Rule statements are introduced by function and test 
genus paragraphs. A function genus rule statement 1s a 
meee presston that Cvalwates to a real number for each 
of its elements. For a test genus, the rule statement is 
formed as a relational expression which values its elements 
as 'true' or ‘false’ (see Figure 3.15). 
The choice of an executable, yet natural, syntax for 
the rule statement was a major deSign issue. The language 


developed is an amalgam of conventional mathematical notation 
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and the syntax used by FORTRAN arithmetic expressions.: 
This combination provides technically trained people with a 
familiar method of specifying arithmetic expressions supple- 
mented by powerful LEXICON system functions. These functions 
enable useful operations such as 'sum', or ‘chose the smallest 
valued element' to be iterated over indexed expressions or 
applied to collections of elements. Rule statement syntax 
and system function capabilities are detailed below. 
a. Real Expressions 

The real primary, the basic unit of a real ex- 
pression, is composed of real constants, system function 
calls and complex generic names. Generic names, in this 
context, can be thought of as the array elements used in 
FORTRAN. The generic name format, and the format of an 
intermediate construct, the domain index expression, are 
illustrated in Figure 3.13. Discussion of system function 
calls and their formats is deferred until later. Figure 
3.14 defines the syntax of the real primary and the real 
expression. These last diagrams are adapted from charts 
produced by Day for FORTRAN [Ref. 7: pg. 60]. 

b. Relational Expressions 

Test genus rule statements are relational ex- 
pressions made up of two real expressions, separated by 
a relational operator. The six relational operators and 
the syntax for a relational expression are shown in Figure 


Sener ; 


oF) 


domain index expression 










integer 
Constance 


° 


index 
funcGt1on 







complex generic name 


basic generic name 
genus name 







domain index 
expression 


Prototype Restriction: No more than 10 domain index 
expressions 


Bigure.3.13 Complex Generic Name Syntax 
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real primary 







system function 
call 





complex generic name 








real expression 





fo 
ea 


“Of 








real primary 








Figure 3.14 Real Expression Syntax 
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relational operators 
Cut) CE) DODDS 


relational expression 


real 
expression 





relational 
operator 
real 
expression 






PLgure. 3.9.5 Relational Expression Syntax 


c. System Function Calls 
The prototype provides seven intrinsic functions 
for computation. Three static functions, "MIN', 'MAX' and 
'ABS', operate over fixed collections of arithmetic expres- 
Sions. They provide the capability to choose the smallest 
value or the largest value from a collection and to compute 
the absolute value of a single arithmetic expression. The 


remaining four system functions perform their defined 
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operations by iterating an arithmetic expression using speci- 
fied domain indices from a chosen genus index set. Iterated 


functions provide the capability to find both largest and 


smallest values ('MIN', 'MAX') and to compute the sum or 
product ('SUM', 'MLX') of real expressions over single or 
multiple domain indices. The syntax of each System funepgem 


type is given in Figure 3.16. 


D. MASTER DICTIONARY Pena 

The master dictionary 1S an ordered list of module and 
genus paragraphs. For each module, it 1S required that its 
parts be contiguous in the acyclic-preserving ordering. The 
modular structure is specified by indenting the list of module 
and genera paragraphs: for each module all the module and/or 
genera paragraphs that are "part of" it are indented (to the 
right) an equal amount. 

The indentations and presentation sequence used are 
acyclic and are restricted such that each module/genus defini- 
tion is completed before any other module/genus definition 
which calls or depends upon that definition (i.e., module/ 
genus definitions are both acyclic and individually con- 
tiguous). This indentation and sequence restriction may 
be viewed as a pre-order presentation of the modular struc- 
ture. This restriction can always be satisfied, usually in 
many ways, and endows the language with desirable properties 


not completely discussed in the present paper. Figures 3.17 
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iterated function 






index set 
name 





domain 
index 






Prototype Restriction: No more than ten domain 
indices 


Beatic function 





system function call 


Tterateawrunet1on Static LuUNnceElOn 





Figure 3.16 System Function Call Syntax 
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and 3.18 illustrate the format of a module paragraph and 
the four formats used by genus paragraphs. 

LEXICON is deSigned to accept a master dictionary, pre- 
pared as a 72-column text file, as input. Although it re- 
quires no particular indentation scheme, the software 
fessor lat level —le(eno entire model 1s tevel—0) modules 
and genera begin in column 1, and that the indentation used 
is consistent within levels of the model. If a paragraph 
Peanot be completed in one line, a continuation line may 
be initialized by a continuation symbol, '..', indented at 
the same depth as the first line of the paragraph. fThis 
prototype allows no more than nine continuation lines per 


paragraph. One blank line may be left between adjacent 


paragraphs, if desired. 


E. ELEMENT SECTION FORMAT 

The element section fixes a concrete instance of the 
model contained in the master dictionary by providing values 
for genus index sets, index functions and the elements of 
attribute genera. Index set members are eee as tuples 
of identifiers. Index function mappings and attribute 
elements are specified by an identifier-tuple and a value. 
eo Oram ndex FUNCtIOnN Mapping 1S an identifier 
from the direct index set of the index rendered dependent. 
The value of an attribute element is a real number. Each 


genus paragraph in the model which creates a new index set, 
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creates a new index function, or is an attribute genus, 
induces a separate data requirement. 

LEXICON is designed to accept this data from a 72-column 
disk file, partitioned by deScriptor statements into 
Megerlptor data blocks of different entry formats. 

Each block contributes values to the members of an index 
set, to the mappings of an index function or to the elements 
of an attribute genus. A block must appear in the same 
order as its parent paragraph in the master dictionary. 

The descriptor statement provides an execution-time 
format for the data block it prefaces. It consists of a 
Gesettptor and the mame of the object being specified. The 
descriptors implemented to preface index set, index function 
Mie@mattribute genus names are “&SET’, ‘&FNX', and ‘&ATR', 
respectively. Descriptor statement syntax is defined in 


Heere 3.19. 


descriptor statement 





Figure 3.19 Descriptor Statement 
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LEXICON checks each data block to ensure that the data 
provided is required by the model and is provided in a format 
consistent with the master dictionary. In general, a six- 
character identifier field is reserved for each identifier 
listed in an identifier-tuple or as an index function value. 
The LEXICON prototype expects identifiers to be right- 
justified within their fields. However this presentation 
can be easily changed to left-justification in a production 
version. A real number value is entered by an attribute 
field. Attribute fields may be no more than twelve charac- 
ters long. A typical data entry always begins in column l 


and is an identifier field (see Figure 3.20). 


data entry 


EE=k 


field 






attribute 
field 


Prototype Restriction: No more than ten 
identifier fields 


Figure 3.20 Data Entry Format 


The number of descriptor data blocks needed to completely 


Or A-partially specify a model, can be reduced by uSing an 


intrinsic system option. Rather than specify all the 
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identifier-tuples of an attribute genus' index set twice 
fomee dn the &SET data block of the referenced index set 
and a second time in the attribute genus &ATR data block), 
LEXICON automatically values and orders the tuples of an 
undefined index set from the &ATR data block alone. 

Data entry requirements are also diminished for unin- 
dexed attribute genera and attribute genera that refer to 
certain index Sets. BecauSe an unindexed genus has no 
identifier-tuple, an unindexed attribute genus is valued by 
providing only one real number for its element. Other attri- 
bute genera may be specified by listing only the real number 
values of their elements in two instances. The first instance 
occurs when the genus index set is) inherited l:iwefrom a 
previously defined direct index set. The second occasion 
when the requirement for an identifier-tuple can be relaxed 
is when the genus index set 1S a CartesSlian-indirect index 
eae inthe sense of this paper, a Cartesian-indirect index 
Set is a set formed by the Cartesian product of two or more 
direct index sets or by the use of this Set operation between 
index sets that were originally defined by the Cartesian 
product of direct index sets. When attribute values are 
not prefaced by identifier-tuples, their attribute fields 
begin in column 1. The sequence of presentation within a 
block is assumed to correspond to the index-induced order 


of the genus index set. 
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F, ILLUSTRATIVE EXAMPLE--THE TANGLEWOOD CHAIR MANUFACTURING | 
COMPANY 


This section introduces an A-partially specified struc- 
tured model as a prelude to the modeling scenario presented 
in Chapter V. The scenario is an enrichment of a formulation 
exercise, posed by Jensen and Barnes [Ref. 9: pg. 12] that 


appears below. 


THE TANGLEWOOD CHAIR MANUFACTURING COMPANY 


The Tanglewood Manufacturing Co. has four plants 
located around the country. The fabrication and 
assembly cost per chair and the minimum and maximum 
monthly production for each plant are shown in 

Table 3.2. The company obtains the twenty pounds of 
wood required to make each chair from two suppliers 
who have agreed to supply any amount ordered. In 
return, the company guarantees the purchase of at 
least 8 tons of wood per month from each supplier. 
The cost of wood is $0.10/71b. fem Supplier i eand 
SO030757 1b. irem’ Supplier 2. The’ sh®ecing cese =n 
S/lb. from each supplier to each plant is shown in 
Table 3.3. The chairs are sold in New York, Houston, 


San Francisco, and Chicago. Transportation costs 
in $/chair between the cities and plants are listed 
in Table 3.4. Table 3.5 shows the minimum demand 


that must be satisfied, the maximum demand and the 

selling price for chairs in each city. 
The model required is to be used for optimization. Subject 
to a criterion of minimizing total cost, a policy is needed 
which specifies: | 

1. where and in what quantities the raw materials 


for each plant should be purchased; 


2. the number of chairs to be produced by each plant; 
and 
3. a distribution plan for -eacheewane ce ouLeuL. 
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Table 3.2 


BeBRICATION COST AND PRODUCTION RESTRICTIONS BY PLANT 


Plant Cost Per Chair Max. Production Mane FP cOocuctlon 
iF Sore Wl 500 : 0 
Z 7010 t5o 400 
S 32.0.0 1000 500 
4 4.00 250 250 
Table 3.3 


See iNGweeol FROMPSOURCE TO PLANT 


‘yib. Of Wood Plant l Plant 2 Plant 3 Plant 4 
Wood 1 Or Om Ors) 2 O04 8) ee 
Source 2 O 204 O03 0.02 0.02 

Table 3.4 


TRANSPORTATION COST BETWEEN PLANTS AND CITIES 


sy chair New York Houston San Francisco Chicago 
al 1.00 00 2200 0.00 
2 S00 GC. 00 7.00 a OU 
oo 3 3.00 1.00 5.00 3.00 
4 or010 200 1.00 4.00 
Table 3.5 


SLL oiGeenaecr AND SDEMAND RESTRICTIONS BY CITY 


Selling Price Max. Min. 
Srey Per Chair Demand Demand 
New York 520.00 2000 500 
Houston Rar 400 100 
san Franciso 20.00 S010 500 
Chicago Sel LS (OKe 500 


one) 


A master dictionary and element section for the Tangle- 
wood problem are shown in Appendices A and B, respectively. 
The model is written uSing the notational conventions of this 
chapter and is executable by the LEXICON modeling software. 

It is based on a formulation prepared by Geoffrion for the 
Same exercise [{Ref. 4: c. 2]. 

The modular structure of the model is shown in Figure 
3.21. The leaf nodes of the arborescence are the model 
genera. Each sub-tree forms a distinct conceptual unit. 

Part of the model describes the wood sources (&SDATA); part 
describes plants (&PDATA); part describes transportation 
(&TDATA); part describes the decisions to be made (&DECISIONS) ; 
part describes the volume and cost consequences of decisions 
(&CONSEQ): and part describes the systems material production 
and sales restrictions (&TESTS). 

A restricted preorder traversal of the arborescence yields 
the indentation and presentation sequence of the master 
dictionary. Preorder is restricted such that the successors 
of a module/node are visited in an acyclic sequence with 
respect to calling dependencies. Indentation level of a 
module/node is analogous to the path length from the module/ 
node to the model module or root node. 

The generic structure of the model is based on three 
primitive entity genera (SOURCE, PLANT, CUST) and two compound 
entity genera (IBLINK, OBLINK). These five genera represent 


all the physical elements of the problem. The remaining 
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genera either assign values to these elements (attribute 

and variable attribute genera) or specify rules which relate 
the elements of one genus to another genus to form real-valued 
or relational expressions (function and test genera). The 
intent of each genus is documented in natural language in 

its master dictionary paragraph. 

The element section for this model contains eleven des- 
Criptor data blocks. Three blocks are used to value the 
three domain indices (i,j,k) introduced by the directly- 
indexed genera (SOURCE,PLANT,CUST). The remaining eight 
blocks specify values for attribute genera. Values for OBC 
have been listed without their identifier tuples to demon- 
strate the format option available to attributes indexed by 
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IV. SOFTWARE DESIGN 


Pee DESLGN PRINCIPLES 

Structured modeling is an evolving concept. Although 
a description of a computer environment to support it has 
been written [Ref. 4: c. 3], the desired functional capabilities 
will be revised as the technigue iS practiced. A software 
design for a prototype, therefore, must be robust. It must 
be both extensible or contractible as additional or unneeded 
capabilities are identified. This flexibility has been 
achieved in this design through adherence to four criteria: 
Subroutinization (we hesitate to use the term modularization 
imene Current context), hierarchical structure, information 
hiding, and the creation of a virtual machine. 

Subroutinization is a mechanism for decomposing a 
system into partial system descriptors. It differs from 
the conventional notion of a subroutine as a Single sub- 
program performing one function by its assignment of a task 
responsibility to groups of interdependent sub-programs. 
Subroutinizations include design decisions which must be made 
before work on individual subroutines may begin. This 
approach enhances flexibility by enabling major revisions 
to be made to one subroutine without a need to change others. 
An added benefit of subroutinization is comprehensibility. 
As system-level functions are clearly defined, the whole 


system can be better understood. 
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A hierarchial structure exists in a system, in the sense 
illustrated ‘by Dijkstra [Ref. 10], if a relation may be 
defined between the components of the system and that relation 
is a partial ordering. The relation used in this design is 
"uses or depends upon' as defined by Parnas [Ref. ll]. 

Partial ordering facilitates implementation of a software 
prototype by providing a usable, testable subset of the 

system at each level. “Coded subroutini zations tay pemusces 
before the system is complete, avoiding the problem of 
"nothing works until everything works' encountered in unstruc- 
tured designs. The 'uses' relation is characterized in the 
design by the existence of a large number of single-purpose 
programs on the lowest level of the structure. These utility 
routines simplify the implementation of the upper level pro- 
grams of the system hierarchy. 

A system may be difficult to revise if too many programs 
are written assuming that a particular feature iS present or 
absent. The concept of information hiding is to identify 
those design decisions which are most likely to change as 
the prototype matures, and to cloak them in subroutinizations. 
The changeable features of each subroutine are not disclosed 
by its interface and remain hidden from other system components 
which use it. These access restrictions increase the robust 
nature of the design by hiding details which should not 
affect other parts of the system. Binding design decisions 


are, thereby, postponed. 
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In the sense that a virtual machine is a suite of pro- 
grams which, when combined with a base machine, provide a 
machine which is more convenient to use than the underlying 
machine, a virtual machine has been created to operate on 
the data types defined by the design. This approach avoids 
the problems posed by 'a chain of data transforming com- 
ponents’ [Ref. ll]. In such a chain, each component receives 
its input from a previous component, performs its function, 
and changes the format of the data before passing it to the 
next stage in the process. If one program is no longer needed 
and is removed, the output provided by its predecessor is 


incompatible with the input required by its successor. 
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Two system abstractions are introduced in the following 
paragraphs in order to make what the system does clearer and 
more understandable. A procedural abstraction is presented 
first to express what is done to the master dictionary/element 
Section representation of the model provided by the user. A 
data abstraction is then presented last to explain the internal 
representation of the model created by the system software. 

1. Procedural Abstraction 

The software consists of the seven sub-systems 

depicted in Figure 4.1. A synopsis of the purpose, input 
and output of each sub-system is provided below. Since this 
prototype is designed for linear optimization models, the 


OPTIMIZE sub-program is treated in greater detail. 
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Figure 4.1 LEXICON Procedural Abstraction 


a. DRIVER Sub-program System Control 
This sub-system displays menu options to the 
user. _It accepts as input the device numbers of designated 
input and output files, and coded options selected from its 
menus. 
b. ERROR/WARNINGS Sub-program 
This sub-system accepts as input unique error 
codes produced by the five sub-programs listed below. It 
prints error/warning messages containing these codes and 
displays the file line(s) or arithmetic expression producing 
each error up to the point of error recognition. 
©. DISPLAY Sub -precimam 
The DISPLAY sub-system allows the user to create 


different views of the master dictionary representation. 


a, 


Sub-system options offer the following perspectives of the 
entire master dictionary file or of the model attribute 
paragraphs alone: 

1) view natural language interpretations of each 

paragraph only; 
2) view technical documentation (paragraphs less inter- 
pretation statements) only, or 

3) view paragraph names only. 
DISPLAY performs its responsibilities without creating an 
internal representation of the model. 

Oly (SOOKE ARIUS, (Sib ee ensterenashil 
This sub-systemaccepts the user-designated 
master dictionary file as input and creates an internal 
representation of the model. It ensures that each genus 
paragraph is in correct format and that its executable 
statements are consistent with those introduced by preceding 
paragraphs. When the model is compiled without error, one 
of two alternatives must be chosen. The user may invoke 
the VERIFY module to verify the integrity of the -internal 
representation before loading the model data or the model 
data may be loaded directly. 
e. VERIFY Sub-program 
The VERIFY sub-system reduces each test genus 

rule statement in the internal representation to an arith- 
metic expression composed of variable attributes (variables), 


attributes (data) and real-valued constants. This is done by 
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iteratively replacing each occurrence of a function generic 
name ina test rule statement by the corresponding functional 
rule statement. After reduction is complete, the sub- 
program checks each term and sub-expression of the aggregated 
rule for possible grammatical errors and for agreement with 
the internally represented model. If an error is discovered 
during this task, the module sends a unique error code to 
the ERROR/WARNINGS module and begins verification of the next 
test rule statement. 
f. LOAD DATA Sub-program 

This sub-system accepts aS input a user-designated 
element section file. The data extracted from this file are 
stored in data structures prepared by the COMPILE sub- 
program from the master dictionary. Errors occurring during 
data input are identified by unique error codes and their 
respective file line numbers. After the last data entry has 
been accepted, the sub-system checks that the data requires 
ments of each genus have been met. The user is notified of 
violations found by the ERROR/WARNINGS sub-program. 

Gs. OPTIMIZE SUesproczam 

The OPTIMIZE sub-system performs three primary 
tasks. First, it prompts the user to formulate a linear 
program from rule statements of the test genera (constraints) 
and function genera (candidate objective functions) contained 
in the model. Its second task is to generate a representa- 


tion of the optimization model for the solver. This is done 


Die 


by using the sub-program depicted in Figure 4.2. After the 
last constraint has been generated, the OPTIMIZE sub-system 


invokes the solver and displays the results of the optimization. 
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Figure 4.2 Matrix Generator Sub-programs 


The solver representation of the LP is a matrix 
generated in row form by a four-stage process (Figure 4.2). 
The first stage in the sequence performs the task responsi- 
bilities of VERIFY, and screens flaws from the formulation 
in the following way. Flawed test rule statements are 
reported to the user and voided from the solver matrix. 
If an error is detected in the specified objective function, 
the user is required to replace it with an existing function 


rule statement or quit the LEXICON system. The output 


60 


of the first stage expresses each rule-type in an infix code. 
The second stage of the transformation converts the infix 
code to postfix notation. The third stage retrieves real- 
valued data from the internal structure and evaluates the 
rule in postfix coding to produce the objective functicnmes 
a constraint. The fourth stage has three responsibilities. 
It aggregates multiple instances into a single, real-valued 
coefficient for each variable in the rule; it aggregates 
real-valued constants and data to form a row resource value 
for constraints; and it passes the objective function vector 
or a canonical LrOW VeG@ECr sto thesoouimizome Li Some ocr 
genus is indexed, stages three and four are repeated until 
a constraint has been generated for each test element in 
that genus. 
225 Data Adstrace ron 

LEXICON requires two input files to create a model 
which may be manipulated. The master dictionary file docu- 
ments the conceptual model and is transformed into a machine 
representation that can be checked for internal consistency. 
The element section file completes the model instance by 
specifying the index sets, index functions and attribute 
values necessary to formulate the case desired. 

The internal representation of the conceptual model 
1s composed of genus records, index set records, index 
function records, attribute table mecordsmanadmilicts of 


character data. Each genus paragraph in the master 
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dictionary file produces one genus record. The paragraph-to- 
record mapping may be one-to-many, depending on the genus 
type and whether a new index set or index function has been 
introduced by the paragraph. Each paragraph may produce 
multiple index function records; it may produce at most one 
of each of the remaining record types. 

The components of each record type are shown in 
Tables 4.1 and 4.2. Lists, accessed by special purpose 
programs, are used to store genus names, domain indexes, index 
function names, and rule statements. These are the only 
positions of the genus paragraph which are stored intact. 
Access to the internal representation is restricted by the 
system's virtual machine. 

The techniques used to store the index sets, index 
functions and real-valued data contained in the element 
section file are principal features of the design. Since 
identifier-tuples are components of the two latter data 
types, index set representation and storage will be discussed 
feo st . ” 

The fundamental index set is the direct index set 
introduced by a directly-indexed genus. A direct index set 
has a defined domain index whose values are specified by 

an ordered list of identifiers. The system represents this 
concept by creating an identifier proxy, called an element 
number, that is equal to the list position of each identifier. 


Hence, element numbers for a domain index range from 1 to 
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Table 421 


GENUS AND INDEX SET RECORD DETAIL 


Data Record iyvpe 


Genus 


Index Set 


Be 


C. 


d. 
— 
fie 


Fields 


Character string length of the 
genus name. 


Pointer to the name string in the 
Genus Name List. 


Pointer to the Index Set Record 
referenced by the genus. 


Code designating genus type. 


Character string length of the 
generic type. 


Pointer to the rule string in 
the Genus Rule List. 


Pointer to the Attribute Table 
Record. 


Note: (3) ve, fvare nw tore ae, 
ce, va, and a genera. 
(ji) “Gis nulterorwne, cer 
t and 4 genera. 


Pointer to the Genus Record that 
introduced the index set. 


Code designating the set type. 


Logical field, valued ‘true’ if 
the index set is a Cartesian 
product of the underlying direct 
sets or a direct set itself. 


Set cardinality. 
Set dimension. 


Pointer to the list of tuple num- 
bers computed for each non- 
Cartesian product index set; 

or 
Pointer to the list of identifier 
string lengths for sets that 
introduce an index. 


Pointer to the list of identifier 
tokens. 


Note: (i) ££ is null for index sets 
derived by Cartesian 
products of the under- 
lying direct sets. 

(ii) gis null for index sets 
that do not introduce an 
index. 
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1. 


2. 


Table 4.2 


Pte eB hbeAND SE UNCTIONAL INDEX RECORD DETAIL 


Data Record Type 
Attribute Table 


Functional Index 


Pointer to the Genus Record 
thiatwinkeoadaucedethemattribute 
table. 


Pointer to the Index Set 
Record referenced by the 
attribute table. 


Pioumeietsne (cee ge@ia;Diectcs (alchorr 


Pointer to the Index Set 
Record that provides the 
index function range. 


inde <erunection domain 
aimensione— le 


Pointers to each of the n 
Index Set Record domain 
arguments. 


Pigex, Eunectton Carcinallty. 


Pointer to the list of tuple 
numbers, computed from the 
domain argument, and to the 
range element number mapped 
From that argqumenc. 
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the number of identifiers specified for its direct index set. 
The identifiers of each direct set are explicitly stored 

in a list. Respective element numbers can be derived, as 
needed, from their list sequence. 

More complex index sets are introduced by genus 
paragraphs that are not directly-indexed. Each identifier- 
tuple of an indirectly indexed set may be equivalently 
expressed as a tuple of the element numbers of its components. 
For simplicity, and to save storage, each element number 
tuple is represented as a single integer rather than a 
tuple. The tuple number preserves the index-induced order- 
ing of the tuples of the indirect index set. An additional 
benefit of this scheme is that the tuple numbers of directly 
indexed sets, and indirectly indexed sets that are Cartesian 
products of the underlying directly indexed sets, need not 
be stored. Instead, they are computed as required. This 
distinction is recorded in the index set record logical 
field (see Table 4.1). The tuple numbers of other indirectly 
indexed sets are sorted and stored in alist. The list is 
accessed via a pointer in the index set record. 

Each tuple number is a function of the cardinalitiee 
of the domain indices over which the set is defined. The 
largest tuple number that can be computed for a given set 
1s the product of its index cardinalities. For a multi- 
dimensional set, this number could exceed the largest integer 


representable on a computer oe ~ 1 on an IBM 3033, a number 


OVEE two ba linen). 


G2 


Although this bound does limit the dimension and range of 
index values of indirect sets, we believe that few practical 
problems will be hindered by this restriction. For example, 
a set with nine indices, where each index has ten elements, 
would contain only one billion members. 

Each real-valued data element in the model instance 
1s stored inalist. A block of memory locations, equivalent 
to the cardinality of the index set over which the attribute 
1S defined, iS reserved in the list for each attribute genus 
in the model. Data value storage assignments within a 
Specific attribute genus block are made by converting each 
value's identifier-tuple to an ordinal position within the 
referenced index set. Locations within the block which are 
not asSSigned values retain a unique initialized value. This 
coding alerts the matrix generator module to data elements 
required by the model that remain unspecified. 

Functional index data, like attribute data, 1s stored 
in a list, partitioned into separate blocks of storage for 
each index function defined by the model, Unlike attributes, 
the length of a specific index function block is determined 
by enumeration of the element section file entries provided 
for that record. Two values are stored for each index 
function instance: the tuple number calculated from its 
identifier-tuple component and the element number of its 
identifier value. After the last block entry is stored, 
the block is sorted into an ascending sequence over the 


tuple number entries. 
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eC CAPABILITIES DESIGNED BUT NOT IMPLEMENTED 

Sub-program interfaces have been designed for two addi- 
tional capabilities. These represent advanced features of 
structured modeling that require considerable coding and add 
relatively little to the evaluation goal of the prototype. 

1. Default Index Set 

Two limitations of the implemented prototype are 

the requirements that each genus index set statement be 
written explicitly, and that index sets which are not Cartesian 
products of the underlying direct index sets be enumerated in 
the element section file and explicitly stored. In practice, 
the user may desire to construct an index set for an indirectly 
indexed genus that consists of all possible tuples for which 
the generic calling sequence is well-defined. Geoffrion 
[Ref. 4] has written an algorithm that uses relational 
algebra to calculate the default set. When implemented, this 
feature will free the user from both limitations when the 
Size of an index set is unrestricted. 

2. Filter Index Sets 

The index sets derivable from generic calling 

sequences and index set statements are meant to represent 
the physical relationships which exist in the model. These 
Static relationships are adequately represented by the index 
set types (direct, indirect-Cartesian, indirect-default, 
and indirect-select) previded@by the designe Uhese Seu 


however, are not sufficient to represent all sets needed 
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for computations defined over sets in a richer grammar. 


Consider the generic rule 


Sic oOmmOleek Litei (A(i,k) ) 
where 


= O,s an wimeirect set defined on domain 
indices '1', and '‘'k'; 


-A(i,k) is an attribute or function genus 


indexed by {Q} 


The set {z} is a function of domain index 'i' and may have 

no physical interpretation outside the context of the summa- 
tion. (Geoffrion has suggested that if the set {z} did have 
physical meaning in a Structured model, it Should be formally 
Paesoalced as a2 Compound entity genus.) in general, these 
dynamically contrived sets, Qi ieelibieeie “Gee, sce “GHekiSissy er 
the underlying set, created by a filter or rule defined on 
the domain indices of the underlying set. The software 


design includes the capability to implement filter sets at 


a later date. 
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V. PRACTICAL TAXONOMY 


In this chapter we highlight some of the practical 
aspects of structured models and of the LEXICON system. 
Section A addresses the issue of model revision. Section 
B demonstrates the facility of expressing a general linear 
programming model as an executable, structured equivalent. 
Section C illustrates the LEXICON user interface through an 
optimization session vignette. Section D describes the 


diagnostic features of the modeling software. 


A. REVISING A STRUCTURED MODEL 

A model does not always progress in an, orderly fashion. 
The Brae emre nea concept of his model will often change as 
he gains insight about the inter-relationships and specifi- 
cations of the system he is attempting to abstract. For 
this reason, any computer environment designed for modeling 
must allow the modeler to detail, revise, and, if necessary, 
undo this work. 

A fundamental issue in the design of a structured 
modeling prototype is whether the user should be allowed to 
modify the internal representation of his model without 
changing its external form. The decision not to allow 
internal manipulation simplifies the design and maintains 
the principle that the master dictionary/element section 


files and the internal representation portray the same model. 


ow 


Thus, we have decreed that the model can be modified only 
Pyeciaggind Seve mcdstem dictionary/element section files. 

The organizing framework provided by structured modeling 
enables model flexibility by helping the practitioner codify 
his concept into a hierarchy of conceptual modules. This 
modular structure provides two benefits. First, the model 
is truly separate from the data. Recall the Tanglewood Chair 
Manufacturing Company problem intruduced in Section III.E. 

For example, the number of plants, the number of Sources and 
the real-valued specifications of the Tanglewood model can 

all be changed without modifying the conceptual model expressed 
by the master dictionary. Second, SARS canon ents of 
non-entity genera, and changes which do not violate the order 
of the modular structure are easily accommodated. This latter 
feature is particularly valuable when the real-valued data 
provided must be scaled or aggregated to a more convenient 
iO « 

Two examples are presented to illustrate how revisions 
might be made in practice to an existing structured model. 

An original assumption made when the Tanglewood master dic- 
tionary (Appendix A) was prepared was that all plants required 
20 lbs. of wood to produce one chair. This figure was ex- 


plicitly used in the T:BAL]1 paragraph's rule statement, 
SUM i {SOURCE} (IBFLOW(i,j)) EQ 20.0 * PROD(}3) 


to specify that total wood purchases must match production 


at each plant. Suppose that new machinery has been installed 
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at plant 2 that reduces the amount of scrap produced in the 
manufacturing process. Plant 2 now requires only 18 lbs./ 
chair. How would the model be revised to reflect this change? 
This factor is clearly a real-valued specification, asso- 
ciated with each plant, that belongs conceptually in the 
&PDATA module. Two dictionary edits, followed by one element 
section edit are required to affect the change. The first 


edit introduces the genus attribute paragraph 


PMAT (PLANT(j)) /a/ {PLANT} :: Each PLANT has a unit 
. MATERIAL REQUTREMENT in lbs/chair. 


in the &DATA module in any position after PLANT. This 
placement guarantees that the calling sequence statement and 
index set statement of PMAT are well-defined. The second edit 
revises the T:BAL]1 paragraph's rule statement and calling 


sequence statement: 


T:BAL1 (IBFLOW(-,73),PMAT(j),PROD(j)) /t/ {PLANT} 
; SUM i {SOURCE} (IBFLOW(i,j)) EQ PMAT(3j)* 

-. PROD()) :: Do totale WOOD PURCHASES @iacer 

..PRODUCTION at each PLANT? 


The calling sequence statement now reflects that each element 
of the genus refers to one element of PMAT. The nature of 
this reference is the PMAT(j) term in the rule statement. 
Note that the derived domain index of the genus is still 

' 


j'. The last edit inserts the following data block into 


the Clement sseer1 on mre 


cae 


&ATRA~.PMAT 

Bee lAAnZ 0. 0 
nn 12 or0 
AnaPL3 20e0 


The next change made to the Tanglewood problem demon- 
Strates a transition from the low-level data available from 
the physical system to the high-level data required by the 
model. Suppose that the freight rate from each source to 
each plant, IBC, 1s not available explicitly for each 
transportation link. Instead, each individual rate must 
be computed as the product of a fixed factor and the highway 
mileage from each source to each plant. 

The first step in this transformation is to write a 
genus paragraph for each kind of data. sinee mileage is : 
real-valued speci ti Gamien defined for each inbound transpor- 
tation link IBLINK(I,J), it should be expressed as an IBLINK- 


memated attribute 


MeaMtLES (LBLINK(i,3)) /a/ 1IBLINK} 22 There is an 
INBOUND TRANSPORTATION MILEAGE for each INBOUND 
TRANSPORTATION LINK expressed in miles. 


The fixed factor can be expressed aS an unindexed attribute 
genus or, like IBMILES, can be indexed by IBLINK. The 
later alternative requires that a Separate data entry be 
made for each transportation link. Indexing the paragraph 
would be a preferred approach if the cost factor is likely 


to vary by origin or destination in the future. The unindexed 


es 


paragraph for the cost factor genus is 


IBSRATE (IBLINK) /a/ 1 :: All INBOUND TRANSPORT- 
..- ATION LINKS have the same fixed BASIC INBOUND 
..- FREIGHT RATE in $ per pound of wood. 


IBSRATE's status as an unindexed genus iS apparent by its 
lack of domain indices in its calling sequence statement and 
its index statement's binary value. Both paragraphs would 
be introduced in the &TDATA module after IBLINK but before 
Lee 

The second step in this transformation is to rewrite 


the IBC genus in the format prescribed for a function genus. 


IBC (IBMILES(i,j),IBSRATE) /f£f/ {IBLINK}; IBMILES (i,j) 
..*IBSRATE :: Computed INBOUND FREIGHT RATE for 
..each INBOUND TRANSPORTATION LINK. 


The new function genus inherits its index set {IBLINK} from 
the IBMILES component of its calling sequence statement. 
These three new paragraphs complete the data transition. 
No other paragraphs which call the IBC genus need know that 
its elements are computed rather than supplied as element 
section data. An instance of the new model would be speci- 
fied by removing the data descriptor block provided for 
IBC and inserting two new blocks for the two new attribute 
genera. Block sequence would parallel the sequence used in 
the master dictionary. 

The conceptual hierarchy thus allows changes to be made 


in a module without having any impact on any other module. 


a 


Modular structure, however, 1S not a panacea. If the 
practitioner makes so many changes that his conceptual model 
1s no longer valid, he may be able to recover very little of 


his previous work. 


B. FORMULATING A LINEAR PROGRAM AS A STRUCTURED MODEL 

All abstractions used in the traditional linear programming 
approach [Ref. 12: pg. 34] to model building have structured 
model counterparts. Figure 5.1 exhibits a general linear 
programming model expressed in structured modeling notation. 


The same model in algebraic notation appears below. 


Symbol Definition 
i. Resource 1 in the set of resources I 
J Activity j in the set of activities J 
b Availability of resource i 
- Activity level of activity j 
elias The quantity of resource i consumed by 
ais . er : 
operating activity j at its unit level 
Ce The cost of operating activity j at its 
J unit level 
Objective 
) CX, 
2) 
subject to 
) a : Ii eek 
ij ee 
Sane e J 
j= a) 
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&RDATA :: Certain RESOURCE Data are supplied. 
RESOURCE i /pe/ :: There 1S a list of RESOURes 


B (RESOURCE(i)) /a/ {RESOURCE} :: Every RESOURG@E @iaaeeee 
AVAILABILITY in resource units. 


&ADATA :: Certain ACTIVITY DATA are supplied. 
ACTIVITY j”"/%pe/7 :: There is a list of 24ers 


X (ACTIVITY(j)) /va/ {ACTIVITY} <:: Every ACTIVITY has@an 
operating LEVEL measured in activity units. 


C (ACTIVITY(j)) /a/ {ACTIVITY} :: Every ACTIVITY has its 
UNIT G@ST In S@eecr-aceiviCyeumdc. 


A (RESOURCE(i), ACTIVITY (j))) Jaf SECOURCH ={(ACQrIY Gin: 
There are COEFFICIENTS that indicate the quantity of each 
RESOURCE consumed by operating each ACTIVITY at its unit 
Level. 


&RESULTS :: Certain RESULTS follow from the given DATAVamg 
the choice of ACTIVITY LEVELS. 


OBJS (C,X) /f/..1... SUM 1 ACT yey 3 (Ce = sie 
Operating all ACTIVITIES at thelr LEVELS incurs a 
TOA Or Gees | NGRE@ eae 

T:RES (A(i,.),X,B(1i)) /e7 {RESOURCE (| @SUM ) (A) 503 
X(j)) LE B(i) :: Does the TOTAL CONSUMPTION of each 
RESOURCE fall Within ies AvealEAe rol i 2 


IT:NEG (X(j)) /t/ {ACTIVITY! 7; XG) CE O©.0 =.) Deeste2ae 
ACTIVITY LEVEL obey the non-negativity assumption? 


Figure 5.1 Structured LP Model Master Dictionary 


ges 


For the purposes of this illustration, assume that the 
solution algorithm accepts only minimization problems. The 
first common property of these forms is that both assert 
the existence of fundamental elements called resources and 
activities. Resource and activity elements are represented 
in the algebraic form as the index sets I and J. These 
same elements appear in the structured LP model (Figure 5.1) 
as members of the primitive entity genera RESOURCE and ACTIVITY. 
Each primitive entity genus introduces an index set: 
{RESOURCE} indexed by 'i' and {ACTIVITY} indexed by 'j'. 

The resource levels (b.), cost factors (Ca), Ae cia 1 ty 
levels or decision variables (x) and technological coeffi- 


clients (a..) defined in’Figure 5.1 are real-valued data. 


19 
Each algebraic. symbol represents a quantity or property 
associated with resource elements, activity elements, or 
resource-activity interactions. These real-valued traits 
are manifested in the master dictionary as the attribute 
genera B, C, X and A, respectively. Note that X is speci- 
fied as a variable attribute because the values it imparts 
to the models' activity elements are unknown. 

The mathematical equivalence of the two LP representa- 


tions is confirmed by observing that a rule statement exists 


in the master dictionary for each linear functional in the 


algebraic form. These rule statements are contained in the 
function and test genera paragraphs. The objective function 
1s the unindexed function genus OBJS. Each constraint is 
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an element of the test genus, T:RES. T:RES, like the alge- 
braic constraint, iS indexed by the resource set. The last 
test genus, T:X, specifies in its rule statement that each 
decision variable must be non-negative. Its index set is 
{SOURS | 

Once an appropriate element section has been specified, 
the structured linear programming problem can be executed 
to solve for the X(j) decision variables. The complete 


structured LP problem would be stated as 


Minimize OBJS subject to T:RES, T:NEG, “by schowce oie 


Exactly how a modeler would interact with the LEXICON 
modeling system to execute a structured LP model is explained 


in chnewne as CC ellen. 


C. OPTIMIZATION USING 2h een 

The aim of this section is to sketch how someone would 
interact with LEXICON to solve a specific linear program. 
Assume that the problem is in the same canonical class or 
"schema' as the Tanglewood Chair Company model in Chapter 
III and that an element section file has been prepared to 
specify the desired instance (Appendix A). Since LEXICON 
requires an interactive computing environment, the session 
is presented as a series of 'screens' interspersed with 
commentary. System responses are prefaced with a '*' in 
each mock video display line to distinguish them from the 


responses of the user. For reasons of brevity and clarity, 
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both external files in the scenario are error-free. eo iS 
cussion of LEAICON error procedures is deferred to Section 
De) 
l. Screen 1 
The session begins with the prompts and responses 
shown below. 


Bee er COl— Vi Rol ONO 
Soe e he SOUL PU PEG DEE, BLOk ENTER 6° FOR SCREEN 


6 

PEN TERe MASTER DEETTONARY FILEDEF 

alls 

[ Spier PROCHDURE UEolRED: ENTER 

s "DISPLAY" TO EXAMINE DIFFERENT MASTER 
DICTIONARY VIEWS 

sa ScOMP LEE “lO BxpCULE THE MODEL, OR 

s oa MEO med 

DISPLAY 


The user designates the TANGLEWOOD model, one of several 
validated models he maintains, as the master dictionary. 
For convenience, he routes all session output directly to 
the video terminal rather than create a listing in his disk 
space. 
Zs. ocreen 2 

The "DISPLAY" command entered in Screen 1 allows the 
user to select one of the sixteen master dictionary refor- 
matting alternatives supported by the system. 


Soni Ciwone LRED VIEWss ENTER 


: COpE PRESENTATION 

s i ALL PARAGRAPHS 

Z 2 ATTRIBUTE PARAGRAPHS ONLY 
2 

SSHLEeCieREsoOLUTLON?: ENTER 

S CODE PARAGRAPH FORMAT 

= i GENUS NAMES ONLY 

- Z INTERPRETATIONS ONLY. 
& S OMIT INTERPRETATIONS 
3 

ieee oS. Ib eLINE NUMBERS ARE DESIRED 
nS 
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The perspective selected is shown in Figure 5.2. After 
scanning this dictionary excerpt, the modeler concludes 

that the element section file prepared satisfies the real- 
data needs of the model. As no indentation or paragraph 
length errors were detected, the system issues an error-free 


completion message 
= VIEWS COMP Inne 

and prompts the user to specify the next procedure. 
COMP EEE 


3. Screen 3 
LEXICON responds to the last command by compiling 
the TANGLEWOOD master dictionary into an internal structure 
ready to receive data. No departures from notational conven- 
tion or structural consistency are found. 
* DICTIONARY COMPILED: MODEL IS READY TO ACCEPT DATA 


* SELECT NEXT PROCE DUR ae \ he. 
ms “VERIFY” TO CHECK SYNTAX AND BAS re 


zs PROPERTIES OF FUNCTION AND TEST 
ie GENERA BEFORE LOADING DATA, OR 

Zs "LOAD" TO LOAD THE ELEMENT) sic Tile h eek 
% NOW 


The user decides to skip the intermediate procedure and 
load the model data directly. (If the problem added genus 
paragraphs to the TANGLEWOOD master dictionary or revised 
its executable fields, a prudent user would verify the model 
before loading data.) 

LOAD 


* SPECIFY ELEMENT SECTION Paar p rr 
eZ 


2 


SCOST (SOURCE(i)) /a/ {SOURCE} :: Each SOURCE has a UNIT 
WOOD COST in $/lb. 


SMIN (SOURCE(i)) /a/ {SOURCE} :: Each SOURCE has a MINI- 
HUN WOO Vo URGCHASHE sli) pounds/month. 


Be@ol (PLANIT(])))>/7a7 {PLANT} =: R+ :: Each PLANT has a 
Dik UMP RODUSe MON COSTE in S/7chair. 


PVN SPRANG) Ja7 ;PLANT} + Rt :: Each PLANT has a 
MINIMUM PRe@meeriOoN Jimit in chairs/month. 


PMAX (PLANT(j)) /a/ {PLANT} : R+ :: Each PLANT has a 
MAXIMUM PRODUCTION limit in chairs/month. 


Beech (G@UetCkijyeyay, (CUSP” : R+ :: There is a SEELING 
Mvlet sino eniad ret er each Customer. 


Petit eawmm~m a |CUol; :aRt 2: Hach CUSTOMER has a 
MINIMUM DEMAND in chairs/month. 


DMAX (CUST(k)) /a/ {CUST} : R+ :: Each CUSTOMER has a 
MAXIMUM DEMAND in chairs/month. 


IBC (IBLINK(i,j)) /a/ {IBLINK} :: There is an INBOUND 
FREIGHT RATE for each INBOUND TRANSPORTATION LINK in 
S$ per pound of wood. 

OBC (OBLINK(j,k)) /a/ {OBLINK} :: There is an OUTBOUND 


FREIGHT RATE for each OUTBOUND TRANSPORTATION LINK in 
S per chair. 


Figure 5.2 TANGLEWOOD Attribute Perspective 
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Element detail is read from the element section file to 

fill the requirements of the model. After the data has been 
stored, LEXICON completes the internal structure and 

Signals 


* READY TO GENERATE A LINEAR PROGRAM. 


4. Screen 4 
The next sequence of prompts requires the user to 
specify a linear programming model from the function and 


test genera contained in the master dictionary. 


* SELECT CONSTRAINTS: ENTER 

* "SELECT" TO GENERATE A SUBSET OF THE MODEL 
* TEST GENERA, OR 

ke ei 


"SELECT" offers the user the opportunity to insert or exclude 
families of constraints (test genera) in/from his linear 
programming formulation. Since every test genera is of 


interest, tne 


ALL 


response is entered. : 


* ENTER OBJECTIVE FUNCTION GENUS NAME 


$TOT 

* SPECIFY OPTIMIZATION: ENTER 

* "MAX" TO MAXIMIZE, OR 
* "MIN" TO MINIMIZE 

MIN 


Provision of the linear objective function and the optimi- 
zation mode are the last interactive inputs required by the 


system. 


ou 


>.  Themsetucion 

LEXICON uses the linear programming model and the 
internally stored data to generate the LP in a form compati- 
ble with the optimizer. After the solution is obtained, 
the system stores the calculated values of the model's 
variable attributes in the internal structure and issues 
the report shown in Figure 5.3. Additional optimization 
output is provided by the attached optimizer for each test 


element (constraint). 


Pee LEXICON ERROR PROCEDURES 

LEXICON has extensive features for detecting compilation 
and data input errors. The system is capable of detecting 
over 100 different departures from notational convention 
and structured model properties in a master dictionary file 
alone. When an element section file is submitted to A- 
partially specify a compiled model, over 30 additional data- 
model and format inconsistencies can be identified. Although 
these abilities do not help the user create the ‘right' 
model, they can speed the development of a grammatically 
correct and internally consistent representation of the 
problem. 

The model debugging cycle, diagrammed in Figure 5.4, begins 
when a master dictionary file and element section file are 
initially submitted for processing. It ends when an opti- 
mized solution for the model specified by these files is 


Obtained. To reach the terminal state, the model must pass 
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OBJECTIVE: STOT OPTIMAL VALUE = ~¢-3110E+04 


LISTING OF ATTRIBUTE: IBFLOUW 


INDEX SET: IBLINK SET TYPE: CARTESIAN NO- OF EEENENTS= 9a 
DOMAIN 
NUMBER VALUE INDICES: i J 
L 1-Q0000E+04 L L 
e O-6000E+04 L e 
3 0-0 L 3 
4 0-0 L Y 
S 0-0 a L 
b O-9000E+04 e eS 
? ¢-QOO00E +04 = a 
& O- SO000E+04 e Y 
LISTING OF ATTRIBUTE: PROD 
INDEX SET: PLANT SET EE eee ERE NO. OF ELEMENTS: y 
DOMAIN 
NUMBER VALUE INDICES: J 
L O- SO00E+03 L 
= O. 7?S00E+03 a 
3 O.1L000E+04 3 
Y 0O-2S00E+03 Y 
LISTING OF ATTRIBUTE: OBFLOW 
INDEX SET: OBLINK SET TYPE: CARTESIAN NO- OF ELEMENTS: Ib 
DOMAIN 
NUMBER VALUE INDICES: J K 
L 0-0 L L 
e 0-0 L 2 
3 0-0 L 3 
Y O- S000E+03 L Y 
5 O- ?SOOE+03 a L 
b 0-0 e e 
7 0-0 = 3 
ts) 0-0 c 4 
4 0-0 3 L 
LO 0-0 S 
Li O- LO00E+04 3 3 
Le 0-0 3 4 
13 0-0 4 L 
LY O-cS00E+03 Y c 
15 0-0 Y a 
Lb 0-0 Y Y 


Figumae si5 Tanglewood Solution in LEXICON Format 
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each LEXICON sub-program entered without error. If an 
error occurs, the model is denied access to all subsequent 
LEXICON procedures until the fault is corrected. Due to 
the decision not to allow the user to modify the model's 
internal representation, no run-time corrections can be 
made. Error occurrences require the user to guit the 
LEXICON environment and correct the responsible model file 
before the cycle can be continued. 

Debugging either file is made easier by two system 
diagnostic aids: unambiguous error messages and line- 
numbered master dictionary perspectives. Each error detected 
during compilation or data input is identified by a unigue 
code and located, at a minimum, by the file line number of 
the fault or the name of its genus paragraph. The exact 
format of the error message is both fault and module depen- 
dent. After all error messages have been issued, the system 
offers the user the opportunity to create and browse a 
complementing, line~numbered display prior to leaving the 
system. (Text editor and LEXICON line numbering conventions 
are the same.) When both features are used in concert, 
faults can be diagnosed and located quickly during the correc= 
tion step of the cycle. 

Three eclectic error messages, based on the Tanglewood 
model (Appendices A,B) are presented below. Each has been 
contrived to illustrate a different software error diagnostic 


ability. The first example: 
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41 IBLINK (SOURCE(i),PLANT(j3))) /ce/ {SOURCE} { 

FATAL COMPILATION ERROR 2404 RECOGNIZED AT LAST 

PRINTED CHARACTER 
is representative of the messages produced for errors 
detected during manipulation or compilation of the master 
dictionary file. It contains a file line number label, a 
partially displayed paragraph, and an error code. The para- 
graph display is truncated at the point in the line text 
where the error iS recognized. fThis sae iecre message 
indicates that the IBLINK genus paragraph, located at file 
line 41, contains a fault in its index set statement. 
Errorcode 2404 means that the index set operator, required 
between SOURCE and the beginning of the next index set 
feniere ts missing. (“fhe correct statement is {SOURCE}x{PLANT}.) 

Compilation errors in rule statements are signalled 
using a different format. The example message 

TBAL1 - SUM i {PLANT} (IBFLOW(i) | 

FATAL RULE COMPILATION ERROR 521 RECOGNIZED AT LAST 

PRINTED CHARACTER 
includes a genus name label, the partially displayed rule 
statement of that genus, and an error code. Like the first 
format, the point of truncation is the position where the 
rule was no longer decipherable. A file line number is 
not provided because the rule statement text is 
internally stored. The message instance states that a 
fault has been found in the displayed term of the T:BALI1 


genus rule statement. Error code 521 pinpoints the problem: 
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too few domain index subscripts have been provided for a 
generic name. (The IBFLOW paragraph in Appendix A is 
indirectly indexed by i and j.) 

The third example is characteristic of the messages 
issued for model-data inconsistencies found in the element 
section™riie. 

ELEMENY SECTION Bite nie 4 1. 

CONTAINS ERROR 5324 
Although the format is terse, the information content of the 
message is sufficient to locate the offending line uSing a 
text editor and to orceee the fault. This particular message 
refers to the first data record in the DMAX attribute block 
(see Appendix B). Error code 5324 means that the identifier 
tuple listed on this line is not a member of the index set 
specified for the attribute genus by the master dictionary. 
An instance of this fault would be obtained by replacing 


the correct line 
NY. ol O 

by 
PLT IAs0'0. 0 


LEXICON would detect that PLT] 1S not a member of the set 
(NY,H,SF,C) and issue the cited diagnostic. (NY,H, SF, C) eens 
the set of identifiers inherited by DMAX in its index set 


statement from CUST. (See Appendix B.) 
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ore ec ONCLUS LON 


This paper describes a modeling language and a suite of 
computer programs for obtaining an executable structured 
linear programming model. The prototype software contains 
in excess of 4000 lines of FORTRAN source code, over 50 
percent of which are comments. A very high standard of 
self-documentation has been followed to enable additional 
capabilities to be added later, such as a sophisticated data 
editor and/or report writer. FORTRAN was chosen as the host 
language because of its availability. 

Although the LEXICON system is experimental, we do not 
consider it to be limited to textbook-size problems. The 
language it accepts has symbolic indexing sufficient to 
express large-scale LP models. The data storage required to 
run the system is linear in the size of the parameter data. 
Moreover, the algorithm form produced is submitted to a com- 
mercial-quality optimizer capable of solving mixed-integer 
linear programs (MIPs) as well as traditional LPs. LEXICON 
can be extended to accept MIP formulations by making the 
genus paragraph range statement an executable field. This 
enhancement and the incorporation of extensive data manage- 
ment, and report writing capabilities, e.g., ATHENA [Ref. 13], 


would be essential in a production version of the system. 
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A survey of the literature performed by Fourer [Ref. 3: 
pp. 164-166] and our own review of papers published since 
1981 [{Refs. 14,15] identify fifteen computation-capable 
LP modeling language implementations since 1970. We believe 
that LEXICON embodies more of the characteristics of an 
ideal modeling language [Ret wo? pp. 1725074) chanwany ses 
its predecessors. Its notation is powerful and understanda- 
ble, and its structured modeling framework enforces a model 
organization which can be comprehended by a computer in one 
pass. Another advantage of this form is its acyclic nature: 
the model is guaranteed to be finite and closed. 

There are drawbacks to uSing structured modeling as a 
basis for a modeling language. Implicit in the argument for 
this approach is that modelers will find it worth the .trouble. 
Top-down design, an intrinsic discipline of structured modeling, 
can be useful in dealingewith complexity, but lteis nee 
always a natural way to model. Also, the modular structure 
of a model is more conceptual than operational. Structured 
model modules are context dependent: their interfaces to 
other modules are not simple and preclude their immediate 
use in building other models. 

In summary, the implementation we have achieved is 
ambitious and is ready for a full-scale computational 
evaluation. We hope our work stimulates further developments 
in modeling software which ultimately lead to real-time 


decision-making with linear programming. 
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APPENDIX A 


TANGLEWOOD CHAIR MANUFACTURING COMPANY 
MASTER DICTIONARY 


&@DATA :: WOOD SOURCE DATA 


SOURCE i /pe/ :: There are WOOD SOURCES available. 
SOC sf oOURCH sm me ar ss cOURCE ) 2: Each SOURCE has a UNIT 
WOIOID) 1OOISAL Mab etaeyAllicn 
SMIN (SOURCE(i)) /a/ {SOURCE} :: Each SOURCE has a MINI- 
..MUM WOOD PURCHASE in pounds/month. 
&PDATA :: PLANT DATA 
PLANT j /pe/ :: There are PLANTS that produce wooden 
Ehates- 
PCGst (PEANT(4)))7a7, {PEANT} =: Rt -: Bach PLANT has a 
UNI ReeRODUCT HON IGOST in S/chair. 
Pre aCe AN in e/a7/ Ss PLANT}. 2 Rt =: Each PLANT has a 
oN PRePUCrT@GN litmit inesehairs/menen. 
PMAX (PLANT(j)) /a/ {PLANT} : Rt+ :: Each PLANT has a 
MAXIMUM PRODUCTION limit in chairs/month. 
&CDATA :: CUSTOMER DATA 
CUST k /pe/ There are CUSTOMER CITIES where chairs are 
S Gale 
Ee hOhmecQoime yey ay, {CUST} » Rr :: There is a SELLING 


PeiGk. iNnewoyvenair for each CUSTOMER. 


DMIN (CUST(k)) /a/ {CUST} : R+ :: Each CUSTOMER has a 
MINIMUM DEMAND in chairs/month. 


Pie Clot a/ {CUST} :7R+ :: Each CUSTOMER has a 
-. MAXIMUM DEMAND in chairs/month. 


&TDATA :: TRANSPORTATION DATA 


Peo seinem) > PLANT(3)) /ce/ {SOURCE}x{PLANT} 
There 1s an INBOUND TRANSPORTATION LINK from every 
SOURCE to every PLANT. 
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IBC (IBLINK(i,j)) /a/ {IBLINK} :: There is an INBOUND 
FREIGHT RATE for each INBOUND TRANSPORTATION LINK in 
S per pound of wood. 


OBLINK (PLANT(j),CUST(kK)) /ce/ {PLANT}x{CUST} :: There 
1s an OUTBOUND TRANSPORTATION LINK from every PLANT 
to every CUSTOMER. 


OBC (OBLINK(j,k)) /a/ {OBLINK} :: There is an OUTBOUND 
FREIGHT RATE for each OUTBOUND TRANSPORTATION LINK in 
S per chair: 


&DECISIONS :: Certain DECISIONS must be made. 


IBFLOW (IBLINK(1,])) /va/ {IBLINK} =: R+ 22 wOoOb” PUGH a 
(inbound flows) must be decided: how many pounds of 


wood per month are shipped over each INBOUND TRANS- 
PORTA ERCiie rn Nike 


PROD (PLANT(j)) /va/ {PLANT} : R+ +: PRODUCTION musuaa. 
decided: how many chairs per month each PLANT 
produces. 

OBFLOW (OBLINK(j,k)) /va/7 {OBLEINK} > Re =e eCUStG ae 


SHIPMENTS (outbound flows) must be decided: how many 


chairs per month are shipped over eacheculceune 
TRANSPORTATION LINK. 


&CONSEO ;:: Operating CONSEQUENGHs of Eee lores 
&VOLUME :: VOLUME CONSEQUENCES 

PURTOT (IBFLOW(i,-) /f£/ {SOURCE? | "sure | lEL Tire 
(IBFLOW(1,))).:2 TOTAL PURCHASES tpom sace 
SOURCE in pownds of Fvood™=permenontn. 

SALES (OBFLOW(.,k) 7£7 (CUS?) = Sui CES 
(OBFLOW(j),kK)) :: SALES to each CUSTOMER in 
chairs per month. 

&COSTSaee COs. COhNSEOUBNCES 

WOODS (SCOST,PURTOT)7i7 . 3) sul SeOuUnen = ( 5ee oi 
i.) *PURTOL (1) ) = =) WOOD REO ST ancy Tete 

IBS (IBFLOW, IBC) /£/ 1 ; SUM 275 {IBLINK?: (1TBELOWin 
,j))*(CIBC(i,3)) > +: INBOUND  TRANSEORTATIONS ¢o- tae 
So neice ae 


PRODS (PCOST,PROD) /£/ 1 ; SUM j {PLANT} (PCOST(j)* 
PROD(j)) :: PRODUCTION COST in $/month. 


onl 


OB$ (OBFLOW,OBC) /f/ 1 ; SUM j,k {OBLINK} (OBFLOW(j 
oa ObeigQee) pes CULBOUND TRANSPORTATION COST 
Ehees /Momien- 


REVS (PRICE,SALES) /f£/ 1 ; SUM k {CUST} (PRICE(k)* 
SALES(k)) :: REVENUES in $/month. 


$TOT (WOODS, PRODS,IB$,OB$,REV$) /f/ 1 ; (WOOD$+PROD$S 
+IB$+OB$) - REVS :: TOTAL COST in $/month. 


&TESTS :: The DECISIONS are subjected to certain TESTS. 


eo eon PUR hOr( ty, sMiN(i)) /t/ {SOURCE} ; PURTOT({i) GE 
SMIN(i) :: Do TOTAL PURCHASES satisfy the MINIMUM 
WOOD PURCHASE requirement for each SOURCE? 


ie Oban ERO (ay), EMIN( 7 )) /t/ {PLANT} ; PROD(]) GE PMIN( 
j) :: Does PRODUCTION satisfy the MINIMUM PRODUCTION 
like: form each PLANT? 


oe Ob (PRO), EPMAxX())) /t/ {PLANT} ; PROD(]) LE PMAX( 
: Jj) :: Does PRODUCTION satisfy the MAXIMUM PRODUCTION 
limit for each PLANT? 


oe ome sty briN(k)) /c/ {CUST} ; SALES(k) GE DMIN( 
k) :: Do SALES satisfy the MINIMUM DEMAND requirements 
for each customer? 


abel wl onemotm i PMAX(K)) /t/ {CUST} ; SALES(k) LE DMAX( 
k) :: Do SALES satisfy the MAXIMUM DEMAND requirements 
for each customer? 


ae ee eR how (J ),EROD(7)) /t/ };PLANT} ; SUM 1 {IBLINK} 
.. (IBFLOW(i,j)) EQ 20.0*PROD(j) :: Does total WOOD PUR- 
CHASES match PRODUCTION at each PLANT? 


Pe babe (PROP(}]),OBPLOW(),.)) /t/ {PLANT} ; PROD(j) EO SUM 
k {OBLINK} (OBFLOW(j,k)) :: Does PRODUCTION at each 
PLANT match its total CUSTOMER SHIPMENTS? 


T: IBFLOW (IBLINK(i,j)) /t/ {IBLINK} ; IBFLOW(i,j) GE 0.0 
Do INBOUND FLOWS satisfy non-negativity? 


Poop mnowW (OBLINK(}],K)) /t/ {OBLINK} ; OBFLOW(j,k) GE 0.0 
Do OUTBOUND FLOWS satisfy non-negativity? 
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PePPENDIES. 


TANGLEWOOD CHAIR MANUFACTURING COMPANY 
ELEMENT SECTION 


&KSET SOURCE 


SUP1 
SUPZ 
&ATR SCOST 
> UE dO 
SUPZ Oe 75 
&ATR SMIN 


SUPT eo oo 7 
SuEZ 1600040 
&5SE ieee PLANT 


aad 
ele 
Pils 
PLT4 
&ATR PCOST 
PEE igo 
PLT Zo 
Pio 13.0 
PLT4 4.0 
&ATR PMIN 
PLR nO. 0 
PLT2 400.0 
Pillows CeCe 
ELT4. “25070 
&ATR PMAX 
PLIl «S020 
PEEZ ‘see. 0 


PLIS S1OCCrwse 
PLI4. 250210 


oe oll 
NY 
AH 
ae 
e 
&ATR PRICE 
NY (2070 
Bolo 
Shee 26 
C ykeee 


gS 


&ATR DMIN 


OO OO 0 0 ©® 


Ny = 50070 
esl OO ..0 
SE 00-0 
Gy S160) 0 
&ATR DMAX 
Nv ZO00F 0 
H 400.0 
ee OO. O 
CD OOO 
S&AlER IBC 
SUP PLL 
wet §6P LEZ 
Swed .PLT3 
SUP1 PLT4 
SUEZ PLT 
SUEZ PLIZ 
Sle2 PLS 
SUP2 PLT4 
&ATR OBC 
lee O 
He) 
Zea) 
O20 
0 
S20 
ano 
5. © 
SO 
iO 
5. O 
Sao) 
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